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Title: HIGH SPEED SOFTWARE DRIVEN EMULATOR COMPRISED OF A 
PLURALITY OF EMULATION PROCESSORS WITH A METHOD 
TO ALLOW HIGH SPEED BULK READ/WRITE OPERATION 
SYNCHRONOUS DRAM WHILE REFRESHING THE MEMORY 
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the present invention, contain common disclosure and are 
incorporated herein by reference in their entireties: 

"High Speed Software Driven Emulator Comprised of a Plurality of 
10 Emulation Processors with Improved Board-to-Board Interconnection 

C3 Cable Length Identification System/' Serial No. , filed 

Jj , (Attorney Docket No. POU9-2000-0045-US1) . 

i & "High Speed Software Driven Emulator Comprised of a Plurality of 

-2 Emulation Processors with an Improved Maintenance Bus that 

J5 Streams Data at High Speed/' Serial No. , filed 

C3 (Attorney Docket No. POU9-2000-0046-US1) . 



W "High Speed Software Driven Emulator Comprised of* a Plurality of 
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"High Speed Software Driven Emulator Comprised of a Plurality of 
Emulation Processors with Improved Multiplexed Data Memory/' 

Serial No. , filed , (Attorney Docket No. 

POU9-1999-0183-US1) . 
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Field of the Invention: 

This invention relates to a software driven emulator comprised of 
a large number of single bit processors operating in parallel to 
execute, at high speed, an emulation of a complex processor, and 
5 more particularly, to a system and method for high-speed 
streaming of bulk read and write operations from and to the 
synchronous DRAM module memories in order to support a 
simulation-accelerate mode of operation. 

Trademarks : 

10 S/390 and IBM are registered trademarks of International Business 

U Machines Corporation, Armonk, New York, U.S.A. and Lotus is a 

m registered trademark of its subsidiary Lotus Development 

*il Corporation, an independent subsidiary of International Business 

Lii Machines Corporation, Armonk, NY. Other names may be registered 

|5 trademarks or product names of International Business Machines 

s Corporation or other companies. 

O Background: 

O The usefulness of software driven emulators has increased 
enormously with growth in the complexity of integrated circuits. 

20 Basically, an emulation engine operates to mimic the logical 
design of a set of one or more integrated circuit chips. The 
emulation of these chips in terms of their logical design is 
highly desirable for several reasons. The utilization of 
emulation engines has also grown up with and around the 

25 corresponding utilization of design automation tools for the 
construction and design of integrated circuit chip devices. In 
particular, as part of the input for the design automation 
process, logic descriptions of the desired circuit chip functions 

POU9-20 00-004 7 -US1 
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are provided. The existence of such software tools for 
processing these descriptions in the design process is well 
suited to the utilization of emulation engines which are 
electrically configured to duplicate the same logic function that 
5 is provided by a design automation tool. 

Utilization of emulation devices permits testing and verification 
via electrical circuits of logic designs before these designs are 
committed to a socalled "silicon foundry" for manufacture. The 
input to such foundries is the functional logic description 

10 required for the chip and its output is initially a set of 
photolithographic masks which are then used in the manufacture of 
the desired electrical circuit chip device. Verifying that logic 

H designs are correct in the early stage of chip manufacturing 

Ifl eliminates the need for costly and timeconsuming second passes 

1$ through a silicon foundry. 

" Another advantage of emulation systems is that they provide a 
s device that makes possible the early validation of software meant 
" : 2 to operate the emulated chips. Thus, software can be designed, 
C3 evaluated and tested well before the time when actual circuit 
JO chips become available. Additionally, emulation systems can also 
C3 operate as simulator-accelerator devices thus providing a 
highspeed simulation platform. 

Emulation engines of the type contemplated by this invention 
contain an interconnected array of emulation processors (EP) . 

25 Each emulation processor (hereinafter , also sometimes simply 
referred to as "processor") can be programmed to evaluate logic 
function (for example, AND, OR, XOR, NOT, NOR, NAND, etc.). The 
program driven processors operate together as an interconnected 
unit, emulate the entire desired logic design. However, as 

30 integrated circuit designs grow in size, more emulation 
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processors are required to accomplish the emulation task. An aim, 
therefore, is to increase the capacity of emulation engines in 
order to meet the increasingly difficult task of emulating more 
and more complex circuits and logic functions by increasing the 
5 number of emulation processors in each of its modules. 

For purposes of better understanding the structure and operation 
of emulation devices generally, and this invention particularly, 
United States Patent No. 5,551,013 and patent application Serial 
No. 09/373,125 filed August 12, 1999, both of which are assigned 
10 to the assignee of this application, are hereby incorporated 
herein by reference. 

H. Patent No. 5,551,013 shows an emulation chip, called a module 
SIl here, having multiple (e.g. 64) processors. All processors 

within the module are identical. The sequencer and the 
15 interconnection network occurs only once in a module. The 
77 control stores hold a program created by an emulation compiler 
s for a specified processor. The stacks hold data and inputs 

previously generated and are addressed by fields in a 

C3 corresponding control word to locate the bits for input to the 

^ pi 

2fi logic element. During each step of the sequencer an emulation 

C3 processor emulates a logic function according to the emulation 
program. A data flow control interprets the current control word 
to route and latch data within the processor. The node-bit-out 
signal from a specified processor is presented to the 

25 interconnection network where it is distributed to each of the 
multiplexors (one for each processor) of the module. The node 
address field in the control word allows a specified processor to 
select for its node-bit-in signal the node-bit-out signal from 
any of the processors within its module. The node bit is stored 

30 in the input stack on every step. During any operation the 
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node-bit-out signal of a specified processor may be accessed by 
none, one, or all of the processors within the module. 

Data routing within each processor's data flow and through the 
interconnection network occurs independently of and overlaps the 
5 execution of the logic emulation function in each processor. 
Each control store stores control words executed sequentially 
under control of the sequencer and program steps in the 
associated module. Each revolution of the sequencer causes the 
step value to advance from zero to a predetermined maximum value 

10 and corresponds to one target clock cycle for the emulated 
design. A control word in the control store is simultaneously 
selected during each step of the sequencer. A logic function 

k Z operation is defined by each control word. 

Each of these emulation processors has an execution unit for 

•Li i 

IS processing multiple types of logic gate functions. Each 
r? emulation processor switches from a specified one logic gate 
s function to a next logic gate function in a switched-emulation 
h Z sequence of different gate functions. The switched-emulation 
p sequence of each of the processors thus can emulate a subset of 
|$ gates in a hardware arrangement in which gates are of any type 
S3 that the emulation processors functionally represent for a 
sequence of clock cycles. The processors are coupled by a like 
number of multiplexors having outputs respectively connected to 
the emulation processors of a module and having inputs 
25 respectively connected to each of the other emulation processors. 
The bus connected to the multiplexors enables an output from any 
emulation processor to be transferred to an input of any other of 
the emulation processors. In accordance with the teachings of 
the pending application, the basic design of the 5,551,013 patent 
30 is improved by interconnecting processors into clusters. With 
interconnected clusters, the evaluation phases can be cascaded 
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and all processors in a cluster perform the setup and storing of 
results in parallel. This setup includes routing of the data 
through multiple evaluation units for the evaluation phase. For 
most efficient operation, the input stack and data stack of each 
5 processor must be stored in shared memory within each cluster. 
Then, all processors perform the storage phase, again in 
parallel. The net result is multiple cascaded evaluations 
performed in a single emulation step. Every processor in a 
cluster can access the input and data stacks of every other 

10 processor in the cluster and less space on each module chip for 
the functions that support the processor operation, particularly 
the memory functions. 

1 1 As will be appreciated by those skilled in the art, emulators of 
CJl the type described above have evolved to perform not only in a 
IJf traditional emulation mode but also in a simulation-accelerate 
ifj mode. In this simulation-accelerate mode, there is a requirement 
r" to upload and download large quantities of data from the system 
* SDRAMs for the data capture function of the simulation-accelerate 
^ operating mode. 

SO In the prior art emulators, such as the ET 3.5 and ET 3.7 
O emulators, the protocol for writing data to and reading data from 
SDRAMS requires a hand shake protocol. A word is transferred from 
or to the SDRAM only in response to a "done" signal from the 
memory signaling that the previous transfer operation has been 
25 completed. The "done" signal is required in these prior art 
systems to account for the case where the previous transfer 
operation was delayed by a memory refresh operation. Such prior 
art protocols slow the bulk transfer of data to and from an SDRAM 
memory . 

30 Summary of the Invention: 
POU9-2000-0047-US1 
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An object of this invention is the provision of a system and 
method for rapidly transferring large quantities of data to and 
from SDRAM memories in an emulator in order to support 
simulation-acceleration operating mode, 

5 Briefly, this invention contemplates the provision of a system 
■ and method for bulk transfer to and from the SRAMs in which a 
starting memory address is latched and is then incremented every 
clock cycle to generate a new memory address, as described more 

extensively in copending application Serial No. , filed 

10 (POU9-2000-0048-US1) and assigned to the assignee of this 

application. The addresses are decoded and memory requests are 
H pipelined to the SRAM memory, one every clock cycle. When the 
Cn memory controller detects transfer of the boundary of a 
;i predetermined number of clock cycles or words (e.g. 64 words or 
WS four clock cycles) the burst mode of data transfer is stopped and 
[7 the memory controller waits for a "done" signal before resuming 
s another cycle of the burst transfer mode. The memory controller 
*g on detecting a request on this address boundary first does a 
O memory refresh followed by a requested operation; e.g. a 
||p continuation of the transfer operation. 

Brief Description of the Drawings: 

The foregoing and other features and advantages of the invention 
will be described with reference to the accompanying drawings. 
In the drawings, like reference numbers generally indicate 
25 identical, functionally similar, and/or structurally similar 
elements. Also in the figures, the left most digit of each 
reference number corresponds to the figure in which the reference 
number is first used. 



POU9-2000-0047-US1 



8 



Figure 1 is a high level block diagram of two of 64 modules on an 
emulator board of the type to which this invention is applicable. 

Figure 2 is a flow diagram of the method steps of this invention. 

Detailed Description of the Invention: 

5 Referring now to Figure 1, each board 14 of the emulation 
processor in this exemplary embodiment has sixty-four modules 
12, two of which are illustrated in Figure 1. Each module has an 
E4 chip 16, which includes sixty-four single-bit processors. Each 
module has a main memory 18 comprised of two SDRAMs and a single 
10 SSRAM. These memories communicate internally with the E4 chip 16 
^ • on the module to which these memories are attached, via an 
ui internal module bus. A maintenance bus connects these module 
«L; main memories to other modules on the same board, to modules on 
Ul other boards, and to a work station 26, which serves the entire 
15 emulation engine. There is one memory controller 22 for each 
s board. The memory controller 22 interfaces the reading and 
H writing of data from and into the SDRAMs and SSRAMs in a data 
C3 capture mode of operation in which bulk transfers of data between 
% the work station and the SDRAMs and SSRAMs are carried out via 
El the maintenance bus. 

For bulk transfers in accordance with the teachings of this 
invention, a latch is set by one of the ET4 control stores to 
inhibit further transfers between the ET4 processor chip and the 
SDRAMs on the module, as described in detail in copending 

25 application serial number , (POU9-2000-0048-US1) . The 

memory controller 22 includes incrementing logic into which a 
starting SDRAM address is inserted when the emulator is to 
perform a bulk data transfer operation. This memory address in 
incremented by M 1" in response to each clock signal. Similarly, 
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with the latch set, the memory controller makes a memory request 
(i.e. read or write) to the incremented address in response every 
clock cycle. Periodically, the memory controller, after making a 
memory request, waits for the memory "done" return before making 
5 the next memory request. This periodic pause in the streamed bulk 
memory transfer allows time for the SDRAM refresh cycle, the 
completion of which is signaled by the "done" return signal. As 
will be appreciated by those skilled in the art, the minimum 
frequency at which there is a pause in the bulk streaming 

10 operation is a function of the frequency at which the SDRAM must 
be refreshed. In one specific example, an emulator of the type 
described herein uses commercially available SRAMs, has a clock 
cycle of 90 ns, and transfers data to and from the SDRAM on each 

H clock cycle as a sixteen-bit word for each memory address. The 

p memory controller detects each 64 word boundary and stops the 
burst mode of data transfer while the controller waits for a 

ifl "done" return from the memory in order to resume the burst mode. 

71 Upon detecting the 64 word boundary, the memory controller 

h initiates a refresh cycle. 

Wd Referring now to Figure 2 of the drawings, in a bulk data 
~ transfer to the module memories, a latch is set (block 30) and a 
C3 starting address is entered into a register of the memory 
controller (block 32) . A sixteen bit data word is transferred 
each clock cycle (block 34) the address is incremented (block 36) 
25 and another word is transferred to or from this incremented 
address on the next clock cycle (block 37) . After a 

predetermined number of clock cycles or on a predetermined word 
boundary (e.g. on each 64 word boundary) (block 38) the bulk 
transfer is halted while the memory controller waits for a "done" 
30 return (block 40). When the "done" (block 39) signal is 
returned, the controller resumes the streaming mode of operation, 
which continues until the next 64 word boundary is detected. 

POU9-2000-0047-US1 
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While the preferred embodiment to the invention has been 
described, it will be understood that those skilled in the art, 
both now and in the future, may make various improvements and 
enhancements which fall within the scope of the claims which 
follow- These claims should be construed to maintain the proper 
protection for the invention first described. 
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What is claimed is: 



1 Claim 1. In a software driven emulator comprised of a 

2 plurality of modules on printed circuit boards, each of said 

3 modules including a processor chip and at least one SDRAM 

4 coupled to the processor chip, a maintenance bus coupled to 

5 said SDRAM, and a memory controller coupled to said 

6 maintenance bus, a method executing bulk data transfers to 

7 said SDRAM via said maintenance bus, including the steps of: 

8 transferring data to said SDRAM via said 

9 maintenance bus on each clock cycle for a predetermined 

10 number of clock cycles in succession; 

11 halting the transfer of data after said 

12 predetermined number of data transfers; 

13 initiating a SDRAM refresh cycle after said 

14 halting step; 

15 resuming said transferring step upon receipt 

16 of a done signal after said refresh cycle. 

1 Claim 2. In a software driven emulator comprised of a 

2 plurality of modules on printed circuit boards, each of said 

3 modules including a processor chip and at least one SDRAM 

4 coupled to the processor chip, a maintenance bus coupled to 

5 said SDRAM, and a memory controller coupled to said 

6 maintenance bus, a method executing bulk data transfers to 

7 said SDRAM via said maintenance bus, including the steps of: 

8 transferring data from said SDRAM via said 

9 maintenance bus on each clock cycle for a predetermined 

10 number of clock cycles in succession; 

11 halting the transfer of data after said 

12 predetermined number of data transfers; 

13 initiating a SDRAM refresh cycle after said 

14 halting step; 
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15 resuming said transferring step upon receipt 

16 of a done signal after said refresh cycle. 

1 Claim 3. A method of executing bulk transfers as in claim 1 

2 including establishing a starting address for said bulk 

3 transfer in said memory controller and incrementing said 

4 starting address by one on each clock cycle. 

1 Claim 4. A method of executing bulk transfers as in claim 2 

2 including establishing a starting address for said bulk 

3 transfer in said memory controller and incrementing said 

4 starting address by one on each clock cycle. 

1 Claim 5. A method of executing bulk transfers as in claim 1 

2 wherein a data word is transferred on each clock cycle. 

1 Claim 6. A method of executing bulk transfers as in claim 2 

2 wherein a data word is transferred on each clock cycle. 
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Title: HIGH SPEED SOFTWARE DRIVEN EMULATOR COMPRISED OF A 

PLURALITY OF EMULATION PROCESSORS WITH A METHOD 
TO ALLOW HIGH SPEED BULK READ/WRITE OPERATION 
SYNCHRONOUS DRAM WHILE REFRESHING THE MEMORY 

5 

Abstract of the Disclosure: 

A system and method for bulk transfer to and from the SRAMs in 
which a starting memory address is latched and is then 
incremented every clock cycle to generate a new memory address. 
10 The addresses are decoded and memory requests are pipelined to 
the SRAM memory, one every clock cycle. When the memory 
controller detects transfer of the boundary of a predetermined 
number of clock cycles or words (e.g. 64 words or four clock 
O cycles) the burst mode of data transfer is stopped and the memory 
|J controller waits for a "done" signal before resuming another 
IM cycle of the burst transfer mode. The memory controller on 
Yl detecting a request on this address boundary first does a memory 
»» refresh followed by a requested operation; e.g. a continuation of 
r ~ the transfer operation. 

5.3 
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(Status) (patented, pending, abandoned) 
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Lynn L. Augspurger, Reg. No. 24,227; Lawrence D. Cutter, Reg. No. 28,501; Marc 
A. Ehrlich, Reg. No. 39,966; Floyd A. Gonzalez, Reg. No. 26,732; William A. 
Kinnaman, Jr., Reg. No. 27,650; Lily Neff, Reg. No. 38,254; William B. Porter, 
Reg. No. 33,135; Christopher A. Hughes, Reg. No. 26,914; Edward A. Pennington, 
Reg. No. 32,588; John E. Hoel, Reg. No. 26,279; Joseph C. Redmond, Jr., Reg. 
No. 18,753; Andrew J. Wo jnicki, Jr . , Reg. No. 43,995 Richard L. Aitken, Reg. 
No. 18,791; Clifton E. McCann, Reg. No. 29,565, John P. Shannon, Reg. No. 
29,276; Laurence J. Marhoefer, Reg. No. 21,091; Andrew C. Aitken, Reg. No. 
36,729; and Ralph P. Albrecht, Reg. No. 43,466. 

Send Correspondence to: Laurence J. Marhoefer 

VENABLE 

P. 0. Box 34385 

Washington, D.C. 20045-9998 

Telephone: (202) 962-4800 

I hereby declare that all statements made herein of my own knowledge are true 
and that all statements made on information and belief are believed to be 
true; and further that these statements were made with the knowledge that 
willful false statements and the like so made are punishable by fine or 
imprisonment, or both, under Section 1001 of Title 18 of the United States 
Code and that such willful false statements may jeopardize the validity of the 
application or any patent issued thereon. 

Full Name of sole or first inventor: William F. BEAU SOLE I L 

S^^^ ?^,^JL^ Date: ^/^//^^ 

Residence: 21 Bykenhulle Road, Hopewell Junction, New York 12533 

Citizenship: United States 

Post Office Address: Same as above 

Full Name of second joint inventor: Bryan R. Ce eK * *5r^<svv> * * 

Signature : ^ ^Pl/^ -^ Date: ^ / S~ / 1L & O C 

Residence: 29 Rourfti Hill Road, Poughkeepsie, New York 12603 
Citizenship: United States 
Post Office Address: Same as above 

Full Name of third joint inventor: Tak-kwong NG 

Signature: ^&i^^ )^^V 0 V Date: ^[^ K \ 

Residence: 118 RooseWlt RoJA, Hyde Park, New York 12538 

Citizenship: United States 

Post Office Address: Same as above 



Full Name of fourth joint inventor: Helmut ROTH 




Signature: \J^J^*^^A Date: 

Residence: 69 Dakota Drive, Hopewell Junction, New York 12533 

Citizenship: German 

Post Office Address: Same as above 



9/f I qoo 
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Full Name of fifth joint inventor: Peter TANNENBAUM 



Signat' 




Citizenship: United States 

Post Office Address: P, O. Box 172, Woodstock, New York 124 98 



Full Name of sixth joint inventor: Lawrence A. THOMAS 

Signatu^K^J^^J'^^^ ^ t ^J^^h^^L^ Date: ^ C5 Q 

Residence: 100 Pleasant Ridge Drive, West Hurley, New York 12491 ' ' 

Citizenship: United States 

Post Office Address: Same as above 



Full Name of seventh joint inventor: 



Norton J. TOMASSETTI 



Signature: 




Date: 



Residence: 169 D&rsell Street, Kingston, New York 12401 
Citizenship: ynited States 
Posst Office $&dress: Same as above 




